Skip to main content

Corporation Websites Collection

This collection contains an extracted web archive corpus of 0.8+ million corporate websites (from an original list of ~0.98 websites) extracted from the archive.org web archive, covering the period 1996 to early 2017. This corpus was originally created as a collaboration between the Internet Archive and a group at Dartmouth University, but it may be useful to other researchers.



rss RSS

Show sorted alphabetically
Show sorted alphabetically
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
NHK-or-jp-EXTRACTION_GWB-20200211170714
NHK-or-jp-EXTRACTION_GWB-20200211170714
collection
41
ITEMS
211
VIEWS
collection
eye 211
(W)ARC exraction of nhk.or.jp including subdomains and embeds from Nov 1996 to Feb 11, 2020 (until Dec 2019 from GWB snapshot 2020-01-18 00:27:04, from Jan 2020 from GWB snapshot 2020-01-11 17:07:14).
Corporation Websites Collection
data
eye 3,229
favorite 0
comment 0
Corporation Websites Collection
data
eye 1,877
favorite 0
comment 0
Corporation Websites Collection
data
eye 1,848
favorite 0
comment 0
Corporation Websites Collection
data
eye 2,259
favorite 0
comment 0
Corporation Websites Collection
data
eye 2,160
favorite 0
comment 0
Corporation Websites Collection
data
eye 1,323
favorite 0
comment 0
Corporation Websites Collection
data
eye 3,235
favorite 0
comment 0
Corporation Websites Collection
data
eye 1,239
favorite 0
comment 0
Corporation Websites Collection
data
eye 1,260
favorite 0
comment 0
Corporation Websites Collection
data
eye 1,480
favorite 0
comment 0
Corporation Websites Collection
data
eye 1,305
favorite 0
comment 0
Corporation Websites Collection
data
eye 1,765
favorite 0
comment 0
Corporation Websites Collection
data
eye 1,971
favorite 0
comment 0
Corporation Websites Collection
data
eye 697
favorite 0
comment 0
Corporation Websites Collection
data
eye 989
favorite 0
comment 0